The research was originally discussed after a session at the HBES meetings in 2016 and then discussed again after the cultural evolution workshop last year in Leiden. The idea was fairly straight forward, to take a generative model of cultural change and estimate the origination and extinction rates under different model settings. In other words, this work represents a sort of sensitivity analysis of macro-evolutionary measures of micro-evolutionary dynamics. The model considered here contains the following parameters:
The model of cultural change proceeds by assuming initially that there are \(N(0)\) cultural variants from a different type. Then, some cultural variants are chosen randomly \(u(t)\) and removed from the population according to the \(r\) parameter. Following this, cultural variants are added back into the population according to a given probability.
A burn-in period of 500 time steps is automatically established for parameters to be kept constant and for the cultural system to reach stationarity. After, the burn-in period these previous steps are repeated for the \(t_{max}\) steps and record the frequencies of all variant types at time \(t\), \(t=1,....,t_{max}\).
Broadly, three analytical categories were established with the following parameters:
Default values:
\(t_{max}=50\)
\(c_b = 0.3\)
\(r = 0.3\)
Analysis 1: Constant Population Size; Constant Transmission
\(N=[100,1000]\)
\(\mu=[0.01,0.1]\)
\(b=[-0.03,-0.01,0,0.01,0.03]\)
Analysis 2: Changing Population Size; Constant Transmission
\(N(0),N(t_{max}) = [(100,500), (500,100), (1000,5000), (5000,1000)]\)
\(\mu=[0.01,0.1]\)
\(b=[-0.03,-0.01,0,0.01,0.03]\)
Analysis 3: Constant Population Size; Changing Transmission
\(N=[100,1000]\)
\(\mu=[0.01,0.1]\)
\(b=[(-0.03,-0.01),(-0.03,0),(-0.03,0.01),(-0.03,0.03),(-0.01,-0.03),(-0.01,0),(-0.01,0.01),(-0.01,0.03),(0,-0.03),(0,-0.01),\\(0,0.01),(0,0.03),(0.01,-0.03),(0.01,-0.01),(0.01,-0.03),(0.03,-0.03),(0.03,-0.01),(0.03,0),(0.03,0.01)]\)
In total, these three analyses produce 140 different parameter settings. Initially, population sizes of 10,000 were also included in Analysis 1 and 3, but these resulting files sizes were extremely large (>20 GB) for some parameter settings and difficult to analyze with the existing computing power.
The research originated as a largely inductive exploration of how different parameter settings of a “micro-evolutionary” model of cultural change can be identified using “macroevolutionary” measures. Based on this, the following research questions guided this initial work:
What is the best way to measure changes in the diversity of frequency data through time?
Broadly speaking, three different approaches are evaluated here: These are:
How are each of these diversity measures impacted by different parameters of the cultural change model? Are certain diversity measures more sensitive to population size, mutation rate, or changes in the cultural transmission process?
This is a question linked with first question as to whether certain macro-evolutionary measure pick up different aspects of the model of cultural change. To be honest, there are some broad conclusions that can be likely made, but specific insights might be more difficult.
Can the same dynamics be inferred between methods that incorporate the abundance of variants (PyRate) and those that simply examine the richness of types (LiteRate)?
This is a question that I find very interesting and something that comes up often in macro-evolutionary research. Do we actually need to have high-resolution frequency data on the number of variants (or pottery styles for instance). Or, do we still get the same general picture of diversification if we just count the number of types (or pottery traditions or types). Furthermore, under what conditions can we just use the richness of types (and/or the estimation of their rates) as an approriate proxy for the diversification dynamics.
Apologies for the scale of these plots, I can provide larger and higher resolution ones if necessary. In addition, these are just a few examples, I have plots for all 140 different parameter settings if necessary.
Analysis 1 Stationarity is generally achieved under constant transmission and constant population size. There are often some edge effects in time steps 50 and 1 (which have been removed here). This is largely because I forgot to turn on the setting to remove this time bin during the analysis. As a note, this is a much better result that our attempts at this project three years ago, when PyRate / LiteRate estimates were extremely variable. Since this time, both these methods have a different and much better MCMC algorithm implemented, specifically a RJMCMC as opposed to a Birth-Death MCMC.
Analysis 2 Diversity indices (Row 1) all appear to be sensitive to differences in population sizes and the resulting changes in type and variant diversity. However, they are clearly impacted by the magnitude of the differences in diversity. In particular, a lower strength of frequency-dependent selection generally leads to a more rapid increase in overall richness when population rises, and more severe decline in richness when population declines. This rapid increase or decrease has clear impacts on diversity indices with Simpson and Shannon diversity providing significantly higher diversity values for lower frequency-dependent values (-0.03, -0.01, 0). Coefficient of variation (sd/mean) seems to be influenced by higher values of frequency-dependent selection (0.01, 0.03) indicating greater dispersion in these scenarios.
Analysis 2 Rates of origination, extinction, net diversification (Origination - Extinction), and longevity (1 / extinction), all remain stable when the frequency-dependent selection variable remains stable, irregardless of population size changes. This is most indicative when population sizes increase or decrease the richness increases or decreases linearly. When population sizes are increasing, origination rates always remain higher then extinction rates (as expected), and net diversification rates always remain positive. When population sizes decrease, extinction rates for each value of \(b\) are always higher then their corresponding origination rate. Once again, this is expected and net diversification rates are always negative in the situations. As a note, the graphs above for net diversification often have very small scales, accentuating pretty small differences in origination and extinction in the plots shown above.
Analysis 3 Changing the frequency dependent strength of selection radically changes all measures used in this study. When moving from anti-conformity (\(b=-0.03\)) to higher levels of conformity, diversity declines across the board. Interestingly, origination rates and extinction rates both increase whenever moving from anti-conformity to conformity. However, the overall decline in diversity is driven by a greater increase in the extinction rate relative to the also increasing origination rate. It also appears that the most radical scenario (\(b=-0.03\) to \(b=0.03\)) may take the longest time to re-stabilize and may not be captured in our 25 time steps. A similar pattern can also be recognized when moving from strong frequency dependent selection to lower values. Here, both origination and extinction rates decline quickly, with the exception of moveing from strong to moderate frequency dependent selection, i.e. \(b1=0.03, b2=0.01\). The extinction rates also appear to decline more quickly and stabilize at a lower value, meaning the total net diversification remains positive.
What is the best way to measure changes in the diversity of frequency data through time?
There isn’t a definitive answer to this question as different diversity measures pick up different aspects of the data (i.e. Question 2). Calculating rates definitely removes much of the noise (as is the goal of PyRate / LiteRate) and does not appear to be strongly influenced by the linear increase or decrease of richness due to population changes. I suppose this could be argued as a positive, since we often have poor estimates of population size, or a negative, as we want to know whether changes in population may have had a significant influence on rates. However, diversity indices are strongly influnced by population differences.
How are different diversity measures impacted by different parameters?
As highlighted above, diversity indices and diversification rates do highlight different aspects of the diversification history. What isn’t presented here is the second part of the PyRate / LiteRAte analysis which identifies statistically signficant shift points in the origination and extinction rates. This is one of the main difficulties with using diversity indices, it can be hard to develop conclusions on whether a small increase or decrease is signficant. These results can easily be added in if necessary.
Can the same dynamics be inferred between methods using rates that estimate based on abundances of types and those that just use types?
This can be examined by looking at the differences between rows 2 and 3 in each of the plots. In almost all cases, the general trend of the rates is the same for both analyses. There are some differences in the exact rate values but the trends are nearly identitical. This is at least suggestive that when using rates as a diversification measure, weighting the types by the number of variants produces only minor benefits, such as decrease rate uncertainty.
Where does this research go from here and is there enough here to write a paper for PhilTrans?
Right now, these results seem to interesting, but certainly not ground-breaking results. Are there key concepts or questions that should be considered in this that currently are not? This might include sampling type / variant data and running the same analysis to explore what an “archaeological” sample might look like. Or, the inclusion of identifying statistically significant shift points in rates, something that is available when using diversity indices.Or possibly running the analysis much longer? 50 time steps was chosen as this seemed like a more realistic archaeological number as opposed to 500 or 1000 time steps.
What does this actually tell us about cultural transmission?
The results presented here are more informative of how we measure diversity at different scales rather than providing insights into cultural transmission processes and patterns. Based on these results, what can we say about frequency-dependent selection that hasn’t already been discussed in other papers?
Micro vs. Macro
One of the points that the editors wanted us to explore was the relationship between micro and macro evolution. What aspects of this work are relevant to this broader discussion?